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Abstract 

We introduce two new complexity measures for Boolean 
functions, which we name sumPI and maxPI. The quan- 
tity sum PI has been emerging through a line of research 
on quantum query complexity lower bounds via the so- 
called quantum adversary method fAmb 02\ \Amb03\ \BSS03\ 
^ha04 LMU^, culminating in [SS04J with the realization 
that these many different formulations are in fact equiva- 
lent. Given that sumPI turns out to be such a robust invari- 
ant of a function, we begin to investigate this quantity in its 
own right and see that it also has applications to classical 
complexity theory. 

As a surprising application we show that sumPI^(/) is a 
lower bound on the formula size, and even, up to a con- 
stant multiplicative factor, the probabilistic formula size 
of f. We show that several formula size lower bounds 
in the literature, specifically Khrapchenko and its exten- 
sions \Khr71\ \Kou93V . including a key lemma of HHds98V . 
are in fact special cases of our method. The second quan- 
tity we introduce, maxPI(/), is always at least as large as 
sumPI(/), and is derived from sumPI in such a way that 
maxPP(/) remains a lower bound on formula size. 

Our main result is proven via a combinatorial lemma 
which relates the square of the spectral norm of a matrix 
to the squares of the spectral norms of its submatrices. The 
generality of this lemma gives that our methods can also be 
used to lower bound the communication complexity of re- 
lations, and a related combinatorial quantity, the rectangle 
partition number 

To exhibit the strengths and weaknesses of our methods, 
we look at the sumPI and maxPI complexity of a few ex- 
amples, including the recursive majority of three function, 
a function defined by Ambainis HAmbOSV . and the collision 
problem. 



1 Introduction 

A central and longstanding open problem in complexity 
theory is to prove superlinear lower bounds for the circuit 
size of an explicit Boolean function. While this seems quite 
difficult, a modest amount of success has been achieved in 
the slightly weaker model of formula size, a formula being 
simply a circuit where every gate has fan-out at most one. 
The current best formula size lower bound for an explicit 
function is n^""^^) by Hastad lHas98l . 

In this paper we show that part of the rich theory de- 
veloped around proving lower bounds on quantum query 
complexity, namely the so-called quantum adversary argu- 
ment, can be brought to bear on formula size lower bounds. 
This adds to the growing list of examples of how studying 
quantum computing has led to new results in classical com- 
plexity, including |SV01 KW03 Aar04 LM04|, to cite just 
a few. 

The roots of the quantum adversary argument can be 
traced to the hybrid argument of |BBBV97|, who use it 
to show a f2(i/n) lower bound on quantum search. Am- 
bainis developed a more sophisticated adversary argument 
I Amb02 1 and later improved this method to the full-strength 
quantum adversary argument |Amb03|. Further general- 
izations include Barnum, Saks, and Szegedy |BSS03| with 
their spectral method and Zhang |Zha04| with his strong 
adversary method. Laplante and Magniez | LM04 1 use Kol- 
mogorov complexity to capture the adversary argument in 
terms of a minimization problem. This line of research cul- 
minates in recent work of Spalek and Szegedy |SS04| who 
show that in fact all the methods of IAmb03IIBSS03..Zha04l 
LM04 1 are equivalent. 

The fact that the quantum adversary argument has so 
many equivalent definitions indicates that it is a natu- 
ral combinatorial property of Boolean functions which is 
worthwhile to investigate on its own. We give this quan- 



tity its own name, sum PI, and ad opt the following primal 
formulation of the method, from ISS04l lLM041. Letting 
S C {0, 1}" and / : 5 ^ {0, 1}, be a Boolean function we 

say 

sumPI(/) = mill max — , (1) 

P f(.^)A(y) E ' \/Px{i)Pv{i) 

where p = {px : a; G S*} is a family of probabihty distribu- 
tions on the indices [n]. If (5e(/) is the two sided error quan- 
tum query complexity of / then Qdf) = i^(sumPI(/)). 
We show further that sumPP(/) is a lower bound on the 
formula size of /. Moreover, sumPP(/) generalizes sev- 
eral formula size lower bounds in the literature, specifically 
Khrapchenko and its extensions |,Khr71 ,.Kou93.l , and a key 
lemma of IIHas98l used on the way to proving the current 
best formula size lower bounds for an explicit function. 
We also introduce 



Kl(/)- 




where K is prefix-free Kolmogorov complexity. This for- 
mulation arises from the quantum and randomized lower 
bounds of |LM041. This formulation is especially inter- 
esting because of the intuition that it provides. For ex- 
ample, it allows for a very simple proof that circuit depth 
> Kl(/), using the Karchmer-Wigderson characteri- 
zation of circuit depth I.KW88.I . 

We define a quantity closely related to which we call 
maxPI. 

maxPI(/) = min max — (2) 

Notice that this is like sumPI but where the sum is re- 
placed by a maximum. By definition, maxPI is larger than 
sum PI, but its square is still a lower bound on formula 
size. However, maxPI is no longer a lower bound on quan- 
tum query complexity in general, and we give an exam- 
ple of a partial function / for which sumPI(/) = 2 and 
maxPI(/) = n/2. For this function, the colhsion prob- 
lem, maxPI(/) > Q,{f) = e(ni/3) I ASOI IBHTQTI . 

We look at several concrete problems to illustrate the 
strengths and weaknesses of our methods. We study the 
height h recursive majority of three problem, R— MAJg, 
and show that ^^(R-MAJg) = 17(2'') and a lower bound 
of 4'' for the formula size. We also look at a function de- 
fined by Ambainis | Amb031 to separate the quantum query 
complexity of a function from the bound given by the poly- 
nomial method [BBCj^OlJ. This function gives an exam- 
ple where sumPP can give something much better than 
Khraphchenko's bound. We also give bounds for the col- 
lision problem. 



1.1 Organization 

In SectionlJl we give the definitions, results, and notation 
that we use throughout the paper, and introduce the quanti- 
ties sumPI, maxPI, and Kl. In Section |3l we prove some 
properties of sumPI and maxPI. In Section|4] we show how 
sumPI and maxPI give rise to formula size lower bounds, 
for deterministic and probabilistic formula size. In Sec- 
tion|5] we compare our new methods with previous methods 
in formula size complexity. In Section|6l we investigate the 
limits of our and other formula lower bound methods. Fi- 
nally, in Section0we apply our techniques to some concrete 
problems. 

2 Preliminaries 

We use standard notation such as [n] — {!,..., n}, \S\ 
for the cardinality of a set S, and all logarithms are base 2. 
Hamming distance is written dn- 

2.1 Complexity measures of Boolean functions 

We use standard measures of Boolean functions, such 
as sensitivity and certificate complexity. We briefly re- 
call these here, see lBW02| for more details. For a set 
S C {0, 1}" and Boolean function / : S* ^ {0, 1}, the 
sensitivity of / on input x is the number of positions i £ [n] 
such that changing the value of x in position i changes the 
function value. The zero-sensitivity, written so(/) is the 
maximum over x G /^^(O) of the sensitivity of / on x. 
The one-sensitivity, si(/) is defined analogously. The max- 
imum of so{f), si{f) is the sensitivity of /, written s(/). 

A certificate for / on input x G 5* is a subset / C [n] 
such that for any y satisfying yi — Xi for all i e / it must be 
the case that f{y) = f{x). The zero-certificate complexity 
of /, written Co(/) is the maximum over all x G /^^(O) 
of the minimum size certificate of x. Similarly, the one- 
certificate complexity of /, written Ci(/) is the maximum 
over all a; G /^^(l) of the minimum size certificate of x. 

2.2 Linear algebra 

For a matrix A (respectively, vector v) we write 
(resp. v^) for the transpose of A, and A* (resp. v*) for 
the conjugate transpose of A. For two matrices A, B we 
let A o B be the Hadamard product of A and B, that is 
{AoB)[x,y] = A[x,y]B[x,y]. We write A > BifAis en- 
trywise greater than B, and Ay B when A — B is positive 
semidefinite, that is Vw : v'^{A- B)v > 0. We let rk(A) 
denote the rank of the matrix A. 

We will make extensive use of the spectral norm, denoted 
||A||2. For a matrix A, 

\\A\\2 ~ {\/A : A is the largest eigenvalue of A* A}. 



For a vector v, we let |?;| be the ^2 norm of v. 

We will also make use of the maximum abso- 
lute column sum norm, written ||v4||i and defined as 
||A||i — maxj ^ j] |, and the maximum absolute 
row sum norm, written ||A||oo and defined ||A|loo = 

We collect a few facts about the spectral norm. These 
can be found in l!HJ99l . 

Proposition 1 Let A be an arbitrary m by n matrix. Then 

t 11/111 

1. \\A\\2 = max„,^ 

2. \\A\\l<\\AUA\\^ 

3. For nonnegative matrices A, B, if A < B then \ \A\\2 < 
\\Bh 

2.3 Deterministic and probabilistic formulae 

A Boolean formula over the standard basis {V, A, ^} is 
a binary tree where each internal node is labeled with V or 
A, and each leaf is labeled with a literal, that is, a Boolean 
variable or its negation. The size of a formula is its number 
of leaves. We naturally identify a formula with the function 
it computes. 

Definition 2 Let f : {0, 1}" {0, 1} be a Boolean func- 
tion. The formula size of f, denoted L(/), is the size of the 
smallest formula which computes f. The formula depth of 
J, denoted A{f) is the minimum depth of a formula comput- 
ing f. 

It is clear that L(/) < 2'^*^-''-'; that in fact the opposite in- 
equality d(/) < 0(log L(/)) also holds is a nontrivial re- 
sult due to Spira |Spi71| . 

We will also consider probabilistic formulae, that is, a 
probability distribution over deterministic formulae. We 
take a worst-case notion of the size of a probabilistic for- 
mula. Probabilistic formula size has been studied before, 
for example in I Val84,Bop89, DZ97 Kla04l . 

Definitions Let {fj}ji£J be a set of functions with fj : 
S {0, 1} for each j e J. For a function f : S {0, 1}, 
we say that f is e-approximated by {fj}jeJ if there is a 
probability distribution a — {ajljgj over J such that for 
every x £ S, 

a 

In particular, i/maxj L(/j) < s, then we say that f is e- 
approximatedby formulas of size s, denoted L"^(/) < s. 

Note that even if a function depends on all its variables, 
it is possible that the probabilistic formula size is less than 
the number of variables. 



2.4 Communication complexity of relations 

Karchmer and Wigderson IKW88I give an elegant char- 
acterization of formula size in terms of a communication 
game. We will use this framework to present our lower 
bounds. This presentation has the advantage of showing 
that our methods work more generally for the communica- 
tion complexity of relations beyond the "special case" of 
formula size. The framework of communication complex- 
ity also allows us to work with the rectangle partition num- 
ber, C^{R), which is known to lower bound communica- 
tion complexity and arises very naturally when using our 
techniques. 

Let X, Y, Z be finite sets, and i? C XxYxZ. In the 
communication game for R, Alice is given some x E X, 
Bob is given some y E Y and their goal is to find some 
z G Z such that {x, y, z) G R, if such a z exists. A commu- 
nication protocol is a binary tree where each internal node 
V is labelled by a either a function ay : X ^ {0, 1} or 
by : Y {0, 1} describing either Alice's or Bob's mes- 
sage at that node, and where each leaf is labelled with 
an element z E Z. A communication protocol computes 
R if for all (a;,y) G XxY walking down the tree ac- 
cording to Gy, by leads to a leaf labelled with z such that 
{x,y, z) e R, provided such a z exists. The communica- 
tion cost D(i?) of R is the height of the smallest commu- 
nication protocol computing R. The communication matrix 
of R is the matrix Mfl[a;,j/] = {z : R{x,y,z)}. A rect- 
angle X'xY' with X' C X and y' C F is monochro- 
matic if C\xex\yeY' MrI^jV] 0- The protocol parti- 
tion number (R) is the number of leaves in the small- 
est communication protocol computing R, and the rectangle 
partition number (i?) is the smallest number of disjoint 
monochromatic rectangles required to cover XxY. (Note 
that C^(i?) < C^iR).) 

Definition 4 For any Boolean function f we associate a re- 
lation Rf = {{x, y, i) : f{x) = 0, f{y) = l,Xij^ yj. 

Theorem 5 (Karchmer- Wigderson) For any Boolean 
function f, L(/) = C^iRf) and d{f) = D(i?/). 

2.5 sumPI and the quantum adversary method 

Knowledge of quantum computing is not needed for 
reading this paper; for completeness, however, we briefly 
sketch the quantum query model. More background on 
quantum query complexity and quantum computing in gen- 
eral can be found in rB W02llNcboj . 

As with the classical counterpart, in the quantum query 
model we wish to compute some function f : S ^ {0, 1}, 
where S C S", and we access the input through queries. 
The complexity of / is the number of queries needed to 
compute /. Unlike the classical case, however, we can now 



make queries in superposition. Formally, a query O corre- 
sponds to the unitary transformation 

O : |i, 6, z) I— > |i, 5 Xi,z) 

where i G [n], 6 G {0, 1}, and z represents the workspace. 
A t-query quantum algorithm A has the form A = 
UtOUt-iO ■ ■ -OUiOUq, where the Uk are fixed unitary 
transformations independent of the input x. The compu- 
tation begins in the state |0), and the result of the com- 
putation A is the observation of the rightmost bit of A\0). 
We say that A e-approximates / if the observation of the 
rightmost bit of A\0) is equal to f{x) with probability at 
least 1 — e, for every x. We denote by Qe{f) the minimum 
query complexity of a quantum query algorithm which e- 
approximates /. 

Along with the polynomial method iBBC+Oll . one 
of the main techniques for showing lower bounds in 
quantum query complexity is the quantum adversary 
method |Amb02 Amb03 BSS03 Zha04 LM04|. Re- 
cently, Spalek and Szegedy |SS04| have shown that all the 
strong versions of the quantum adversary method are equiv- 
alent, and further that these methods can be nicely charac- 
terized as primal and dual. 

We give the primal characterization as our principal def- 
inition of sum PI. 

Definition 6 (sumPI) Let S C {0, 1}" and f : S {0, 1} 
be a Boolean function. For every x G S let px '■ [n] — > 
R be a probability distribution, that is, Px{i) > and 
'^iPx{i) = 1- Let p — {px : x G S}. We define the 
sum probability of indices to be 

sumPI(/) = min max 



, E 



We will also use two versions of the dual method, both a 
weight scheme and the spectral formulation. The most con- 
venient weight scheme for us is the "probability scheme", 
given in Lemma 4 of | LM04 1 . 

Definition 7 (Probability Sclieme) Let S C {0,1}" and 
f : S {0, 1} be a Boolean function, and X — 
/^^(0),y = /^^(l). Let q be a probability distribution 
on XxY, and pa,Pb be probability distributions on X, Y 
respectively. Finally let {p'^ i '■ x G X,i G [n]} and 
{p'y i '■ y (z Y,i Cz [n]} be families of probability distri- 
butions on X,Y respectively. Assume that q{x,y) = 
when f{x) — f{y). Let P range over all possible tuples 
{q^PA,PB, {p'x ilx.i) of distributions as above. Then 



PA(/) 



max 

p 



mm 



^PA{x)pB{y)p'x,i{y)p'y,i{x) 



<iix,y) 



We will also use the spectral adversary method. 

Definition 8 (Spectral Adversary) Let S C {0,1}" and 
f : S — !• {0,1} be a Boolean function. Let X — 
/-1(G), y = /"Hi)- + be an arbitrary \X\y.\Y\ 

non-negative symmetric matrix that satisfies V\x,y\ = 
whenever f{x) — f{y). For i G [n], let Ti be the matrix: 



^z[x,y] 



Then 



SA(/) 



ifxi = yi 

r[x,y] ifxi^yi 

liriu 



max 

r max, 



Note that the spectral adversary method was initially de- 
fined IBSS03I for symmetric matrices over X ^Y . The 
above definition is equivalent: if A is a X U F matrix sat- 

'x,y] ^ when f{x) = /(y) 
B 



isfying the constraint that A 
then A is of the form A = 



for some matrix 





B over XxY. Then the spectral norm of A is equal to that 
of B. Similarly, for any XxY matrix A we can form a 
symmetrized version of A as above preserving the spectral 
norm. 

We will often use the following theorem implicitly in 
taking the method most convenient for the particular bound 
we wish to demonstrate. 

Theorem 9 (Spalek-Szegedy) Let n > 1 be an integer, 
S C {0, 1}" andf:S~* {0, 1}. Then 

sumPI(/) = SA(/) = PA(/) 

2.6 The Kl and maxPI complexity measures 

The definition of Kl arises from the Kolmogorov com- 
plexity adversary method ILM04I . The Kolmogorov com- 
plexity Cu{x) of a string x, with respect to a universal 
Turing machine U is the length of the shortest program p 
such that U{p) = x. The complexity of x given y, de- 
noted C{x\y) is the length of the shortest program p such 
that U{{p,y)) = x. When U is such that the set of out- 
puts is prefix-free (no string in the set is prefix of another in 
the set), we write Ku{x\y). From this point onwards, we 
fix U and simply write K{x\y). For more background on 
Kolmogorov complexity consult ILV97I . 

Definition 10 Let S C S" for an alphabet E. For any 
function f : S ^ {0, 1}, let 

Kl(/) = min max min K(i\x,a) + K{i\y,a). 

The advantage of using concepts based on Kolmogorov 
complexity is that they often naturally capture the informa- 
tion theoretic content of lower bounds. As an example of 
this, we give a simple proof that Kl is a lower bound on 
circuit depth. 



Theorem 11 For any Boolean function f, Kl(/) < d(/). 

Proof: Let P be a protocol for Rf. Fix x, y with different 
values under /, and let Ta be a transcript of the messages 
sent from A to B, on input x, y. Similarly, let Tb be a tran- 
script of the messages sent from B to A. Let i be the output 
of the protocol, with Xi ^ yi- To print i given x, simulate 
P using X and Tb- To print i given y, simulate P using y 
and Ta. This shows that Vx,i/ : f{x) ^ f{y),3i : Xi ^ 
y^,K{i\x,a)+K{i\y,a) < \Ta\ + \Tb\ < D(i?/), where 
a is a description of A's and B's algorithms. □ 

Remark A similar proof in fact shows that Kl(/) < 
2N(i?/), where N is the nondeterministic communication 
complexity. Since the bound does not take advantage of in- 
teraction between the two players, in many cases we cannot 
hope to get optimal lower bounds using these techniques. 

An argument similar to that in llSS04l shows that 



2KI(/) ^ 



1 



mm max 



\/ Px{i)Py{i) ^ 



Notice that the right hand side of the equation is identical to 
the definition of sum PI, except that the sum in the denom- 
inator is replaced by a maximum. This led us to define the 
complexity measure maxPI, in order to get stronger formula 
size lower bounds. 

Definition 12 (maxPI) Let / : S" — > {0, 1} be a function 
with S C S". For every x G S let '■ [n] M. be a 
probability distribution. Let p — {px '■ x € S}. We define 
the maximum probability of indices to be 



maxPI(/) = min max 



1 



\/Px{i)Py{i) 



It can be easily seen from the definitions that sumPI(/) < 
maxPI(/) for any /. The following lemma is also straight- 
forward from the definitions: 

Lemma 13 T/'S" C 5 and f : S' ^ {0, 1} is a domain 
restriction of f : S ^ {0,1} to S', then sumPI(/') < 
sumPI(/) andmaxP\{f) < maxPI(/). 

3 Properties of sum PI and maxPI 
3.1 Properties of sumPI 

Although in general, as we shall see, sum PI gives weaker 
formula size lower bounds than maxPI, the measure sumPI 
has several nice properties which make it more convenient 
to use in practice. 

The next lemma shows that sumPI behaves like most 
other complexity measures with respect to composition of 
functions: 



Lemma 14 Let gi, . . . , gn be Boolean functions, and h be 
a function, h : {0,1}" {0,1}- If sumP\{gj) < a for 
1 < J < <^nd sumPI(/i) < b, then for f — h{gi, . . . , gn), 
sumPI(/) < ab. 

Proof: Let p be an optimal family of distribution functions 
associated with h and pj be optimal families of distribution 
functions associated with gj. Define the distribution func- 
tion 

Qxii) = Pg{x){i)P]Ai)- 

Assume that fox x,y G S we have f{x) ^ f{y)- It is 
enough to show that 



H / Pg(v)U)P3,vi^ 

i- Xi^Vi y je[n] y jg[n] 



> 



1 

ab 



(3) 



By Cauchy-Schwarz, the left hand side of Eq. |3] is 
greater than or equal to 



H J2 yP9{x)ij)PjAi)yPg{y)ij)Pj,vii) 



\JPg(x){j)Pg{y){j) ^ \J PoA^)Pj-y'S) 
i-Xi^Vi 




(4) 



As long as gj{x) ^ 9j{y), by the definition of pj, we 
have J2i.x.^y. \/P],x{'>)\/Pj.y{'>') > V«- Thus we can es- 
timate the expression in Eq.0]from below by: 



E 



/ Pg{x)U)Pg{v){j) 

j-gj(.x)=^gj{y) 

By the definition of p we can estimate the sum (without 
the 1 /a coefficient) in the above expression from below by 
1 /b, which finishes the proof. □ 

Another advantage of working with sum PI complexity is 
the following very powerful lemma of Ambainis LAmbOSl 
which makes it easy to lower bound the sumPI complexity 
of iterated functions. 

Definition 15 Let f : {0, 1}" {0, 1} be any Boolean 
function. We define the dth iteration of f, written : 
{0, 1}" {0, 1}, inductively as f^{x) = f{x) and 

f ^ C-^) ~ /(/ (^1? • ■ ■ I ^n"*)! / (^^n^' + l: ■ ■ • I 2;2„£i), . . . , 



Lemma 16 (Ambainis) Let f be any Boolean function and 
the dth iteration off. Then sumPI(/'*) > (sumPI(/))''. 

Combining this with LemmaO we get: 

Corollary 17 Let f be any Boolean function and the dth 
iteration of f . Then sumP\{f'^) = (sumPI(/))''. 

Lemmas and together with the adversary argu- 
ment lower bound for the Grover search |Gro96 Amb021 
imply that for total Boolean functions, the square root of 
the block sensitivity is a lower bou nd on the sumPI com- 
plexity IAmb02l . Hence, by INS94llBBC+"on : 

Lemma 18 (Ambainis) For total Boolean functions the 
sumPI complexity is in polynomial relation with the various 
(deterministic, randomized, quantum) decision tree com- 
plexities and the Fourier degree of the function. 

3.2 Properties of maxPI 

One thing that makes sumPI so convenient to use is that 
it dualizes ISSO-^. In this section we partially dualize the 
expression maxPI. The final expression remains a mini- 
mization problem, but we minimize over discrete index se- 
lection functions, instead of families of probability distri- 
butions, which makes it much more tractable. Still, we re- 
mark that maxPI can take exponential time (in the size of 
the truth table of /) whereas, sum PI takes polynomial time 
in the size of the truth table of / to compute by reduction to 
semidefinite programming. 

Definition 19 (Index selection functions) Let 

f : {0, 1}" {0,1} be a Boolean function, X=f-^{0), 
andY=f^^(l). Fori G [n] let be |X|x|y| be defined 
by Di[x,y\ = 1 — ^xi.yi- We call the set of n Boolean 
fO — Ij matrices {Pi}ig„ index selection functions // 

7. Pi — E, where E[x, y] ^ 1 for every x (z X, 
y €i Y. ( informally: for every x Cz X, y Cz Y we select 
a unique index) 

2. Pi < Di (informally: for every x X, y d Y the 
index we select is an i such that Xi ^ yi). 

Notice that index selection functions correspond to par- 
titioning XxY,m such a way that if x,y are in the ith part, 
then Xiy^yi. 

Theorem 20 (Spectral adversary version of maxPI) Let 

/, X, Y be as in the previous definition. Let A be an arbi- 
trary |X|x|y| nonnegative matrix satisfying A[x,y\ = 
whenever f{x) — f{y). Then 

maxPI / = mm max ;-— ;— , 

^ {P.}. A maxi 11^0/^,112 ' 

where {Pi}i runs through all index selection functions. 



Proof: For a fixed family of probability distributions p = 
{px}, and for the expression 



1 



max 



(5) 



let us define the index selection function Pi[x,y] = 1 if 
i = argmaxj.^._^j^. \/px{i)Py{i) and otherwise. (Argmax 
is the smallest argument for which the expression attains its 
maximal value.) Then the denominator in Eq. |5] becomes 

equal to Y^txt^yt \/P^(^)Pv('^)Pi\-^^y^- replace the 

above system of P^s with any other choice of index selection 
function the value of \/WJ)thJJ)Pi[x,y] will not 

increase. Thus we can rewrite Eq.|5]as 



where here Pi[x, y] runs through all index selection func- 
tions. Thus: 

maxPI(/) = 



1 / max min max > 



' i-.Xi^yi 



\Jpx{i)Py{i)Pi[x,y]- (6) 



Notice that in Eq.|6lthe minimum is interchangeable with 
the second maximum. The reason for this is that for a 
fixed p there is a fixed {Pi[x, y\}i system that maximizes 

Y.i:x,^y, \/Px{i)Py{i)P'Ax,y] for all x,y : f{x) 7^ f{y). 
Thus: 

maxPI(/) = 



1 / max 
/ {P. 



IX max min ^ ^ p^{i)py{i)Pi[x,y] 



Following the proof of the main theorem of Spalek and 
Szegedy we can create the semidefinite version of the above 
expression. The difference here, however, is that we have to 
treat {Pi}i (the index selection functions) as a "parameter" 
of the semidefinite system over which we have to maximize. 
Unfortunately it also appears in the final expression. 

Semidefinite version of maxPI: For fixed {Pi}i let /i^jj^x 
be the solution of the following semidefinite program: 

maximize /i' 

subject to (yi) ^ 0, 

R^oP^> l^'F. 

Define /imax as the maximum of ij!^^^, where Pi{l < i < 
n) run through all index selection functions. Then maxPI = 

l/p 

max- 

We can dualize the above program and simplify it in 
same way as was done in Spalek and Szegedy for the case 



of sumPI with the only change that needs to be replaced 
with Pi, and that we have to minimize over all choices of 

{P^}^. □ 



4 Formula size lower bounds 



Karchmer and Wigderson IIKW88I give an elegant char- 
acterization of formula size in terms of a communication 
game. We will use this framework to present our lower 
bounds. This presentation has the advantage of showing 
that our methods work more generally for the communica- 
tion complexity of relations beyond the "special case" of 
formula size. The framework of communication complex- 
ity also allows us to work with a combinatorial quantity, 
the rectangle partition number, C^{R), which is known 
to lower bound communication complexity and arises very 
naturally when using sum PI. 

4.1 Key combinatorial lemma 

We first prove a combinatorial lemma which is the key 
to our main result. This lemma relates the spectral norm 
squared of a matrix to the spectral norm squared of its sub- 
matrices. This lemma may also be of independent interest. 

Let X and Y be finite sets. A set system S (over X xY) 
will be called a covering if Uses S — XxY. Further, S will 
be called a partition if 5 is a covering and the intersection 
of any two distinct sets from S is empty. A rectangle (over 
XxY) is an arbitrary subset of X x F of the form Xo x Yn 
for some Xo C X and Yo C Y. A set system TZ will be 
called a rectangle partition if 7?. is a partition and each R e 
TZisa rectangle. Let yl be a matrix with rows indexed from 
X and columns indexed from Y and let 7?. be a rectangle 
partition of XxY. For a rectangle R — XqxYq G TZ Let 
Afi be the |Xo| x |y"o| submatrix of A corresponding to the 
rectangle R. For subsets S C XxY we define: 

As[x, y] — A[x, y], if {x, y) & S and otherwise. (7) 

Notice that for a rectangle R, matrices A^i and Aji differ 
only by a set of all-zero rows and columns. We are now 
ready to state the lemma: 

Lemma 21 Let A be an arbitrary \X\x\Y\ matrix (possi- 
bly with complex entries), and TZ a partition of X x Y. Then 

\\A\\l<i:Hen\\M\l 

Proof: By Proposition ||^||2 = max„^i, where 
the maximum is taken over all unit vectors u, v. Let u, v be 
the unit vectors realizing this maximimum. Then we have 



\u*Av\ 




u*Arv 

Ren 



Let ujj be the portion of u* corresponding to the rows of R, 
and vji be the portion of v corresponding to the columns of 
R. Notice that {wfljijgK do not in general form a partition 
of u. We now have 



l^ll 



Ren 

< Y \\ARh\uR\\vR\ 
Ren 



< \'^*rAr^r\ 

Ren 



by Proposition [2 Applying the Cauchy-Schwarz inequal- 
ity, we obtain 



XRen / XRen 



1/2 



Now it simply remains to observe that 

EM'Kf^E E u[xYv[yY^\u\M' = l, 
Ren Ren(x,y)eR 

as 7?. is a partition of XxY . □ 

4.2 Deterministic formulae 

In this section, we prove our main result that maxPI is a 
lower bound on formula size. We first identify two natural 
properties which are sufficient for a function to be a formula 
size lower bound. 

Definition 22 A function ^ : 2^^^^ is called a rect- 

angle measure if the following properties hold. 

1. (Subadditivity) For any rectangle partition TZ of 
XxY,y^(XxY)<Y.^^^ti{R). 

2. (Monotonicity) For any rectangle R C XxY, and 
subset S QXxY, if R C S then ^(R) < fj.{S). 

Theoreml^implies that for any |X| x |F| matrix A with 
non-negative entries S \\As\ \ of expression is a rect- 
angle measure. Other examples include the rank of As for 
any matrix A over any field (see Section 15 .4> . and the fx- 
rectangle size bounds of | KKN95 1. 

Let Si , ^2 be two families of sets over the same universe. 
We say that Si is embedded in S2 (Si -< ^2) if for every 
S eSi there is a 5' G ^2 such that S C S". 

Theorem 23 Let ji be a rectangle measure over 2^^^, S 
be a covering of X x Y and TZ a rectangle partition ofX x Y 



such thatTZ -< S. Then \TZ\ > 



maxses p.{S) ' 

The proof follows by subadditivity and monotonicity of /i. 



Theorem 24 (Main Theorem) 

sumP|2(/) < maxP|2(/) < < L(/) 



Define Px{i) — '^jPj ■^i'^)- Let x,y be such that 

/(^) ^ fiy)- 



Jpx{i)Py(i) 



Proof: We have seen that sumPI^(/) < maxPP(/), and 
C^{Rf) < L(/) follows from the Karchmer-Wigderson 
communication game characterization of formula size, thus 
we focus on the inequality maxPP(/) < C'^{Rf). 

Let 7?. be a monochromatic rectangle partition of Rf 
such that \TZ\ = C^{Rf), and let A be an arbitrary 
\X\ X \Y\ matrix with nonnegative real entries. For R G TZ 
let color(i?) be the least index c such that Xc 7^ yc holds for 
all {x,y) G R. By assumption each R is monochromatic, 
thus such a color exists. Define 

= U color(/?,)=c^- 

Then TZ is naturally embedded in the covering {Sc}celn]- 
For any S C XxY, let fiA{S) = ||is||i. By Lemma 1271 
and item 3 of Proposition ^ is a rectangle measure. 
Hence by Theoreml23l 



max 
^4 max, 



\A 



We have exhibited a particular index selection function, the 
{Sc}c, for which this inequality holds, thus it also holds for 
maxPP(/) which is the minimum over all index selection 
functions. □ 



4.3 Probabilistic Formulae 



J2 JY'^oPjA^) J2oijPjA^)) 



- H \l'^lP3AT')\Joi3P3Ai) 



> 



E 



I 



'J E yPiAT')Pj,y{i 



where for the third step we have used the Cauchy-Schwarz 
Inequality. □ 

This lemma immediately shows that the sumPI method 
can give lower bounds on probabilistic formula size. 

Theorem 26 Let S C {0, 1}" and f : S -> {0, 1}. Then 
L'(/) > ((1 - 2e)sumP\{f)f for any e < 1/2. 

Proof: Suppose that {fj}jeJ gives an e-approximation to 
/. Using Lemma|25lin the contrapositive implies that there 
exists some j £ J with sumPI(/j) > (1 — 2e)sumPI(/). 
Theorem l24l then implies L(/,) > ((1 - 2e)sumPI(/))^ 
which gives the statement of the theorem. □ 



The properties of sum PI allow us to show that it can be 
used to lower bound the probabilistic formula size. 

Lemma 25 Let e < 1/2. If f : S ^ {0,1} is e- 
approximated by functions {fj}jeJ ^ith sumPI(/j) < s 
for every j g J, then sumPI(/) < s/(l — 2e). 

Proof: By assumption there is a probability distribution 
a = {ajljgj such that Pr[/(x) = fj{x)] > 1 — e. Thus 
for a fixed x € S, letting = {j £ J : f[x) = fj {x)}, we 
have J2jeJ '^j — ^ ~ ^- Hence for any x,y & S we have 
J2je J nJ ckj > 1 ~ 2e. For convenience, we write J^^y for 
sumPI(/j) < s there is a family of probability 
distributions pj such that whenever fj (x) ^ fj (y) 



5 Comparison among methods 

In this section we look at several formula size lower 
bound techniques and see how they compare with our meth- 
ods. A bottleneck in formula size lower bounds seems to 
have been to go beyond methods which only consider pairs 
{x,y) with f{x) ^ f{y) which have Hamming distance 
1 . In fact, the methods of Khrapchenko, Koutsoupias, and 
a lemma of Hastad can all be seen as special cases of the 
sumPI method where only pairs of Hamming distance 1 are 
considered. 

5.1 Khrapchenko's method 



One of the oldest and most general techniques available 
for showing formula size lower bounds is Khrapchenko's 



method IKhr71l . originally used to give a tight O(n^) lower 
bound for the parity function. This method considers a bi- 
partite graph whose left vertices are the 0-inputs to / and 
whose right vertices are the 1 -inputs. The bound given is 
the product of the average degree of the right and left hand 
sides. 

Theorem 27 (Khrapchenko) Let S C {0,1}" and f : 
S -> {0, 1}. Let A C /-i(0) and B C f-^{l). Let C 
be the set of pairs {x,y) g AxB with Hamming distance 
1, that is C = {{x,y) € AxB : dH{x,y) = 1}. Then 
L(/)>sumPI(/)2>^. 

Khrapchenko's method can easily be seen as a special 
case of the probability scheme. Letting A, B, C be as in the 
statement of the theorem, we set up our probability distri- 
butions as follows: 

• for all x^A, pa{x)=Q otherwise 

• pb{x)—1/\B\ for all x&B, pb{x)=0 otherwise 

• q{x, y)—l/\C\ for all {x, y)&C, q{x, y)=0 otherwise 

• Px.i{y)—^ if {^1 v)&C and Xi ^ yi, otherwise. Note 
that this is a probability distribution as for every x there 
is only one y such that {x,y)EC and Xi ^ yt- 

By Theorem|9]and Theoreml24l 

PA{x)pBiy)p'^Ay)PyA^) _ |cp 



L(/) > min 



/(^)^/(y), 



q{x,y) 



\A\\B\ 



where the expression in the middle is a lower bound on 
sumPI(/)2. 

5.2 The Koutsoupias bound 



Koutsoupias IKou93l extends Khrapchenko's method 
with a spectral version. The weights are always 1 for pairs 
of inputs with different function values that have Hamming 
distance 1, and everywhere else. 

Theorem 28 (Koutsoupias) Let f : {0, 1}" {0, 1}, and 

let A C f-^{Q), and B C f-^{l). Let C = {{x,y) G 
AxB : dH{x,y) — 1}. Let Q be a \B\x\A\ matrix 
Q[x,y\ = C{x,y) where C is identified with its charac- 
teristic function. Then L(/) > sumPI(/)^ > \\Q\\2- 

Proof: The bound follows easily from the the spectral ver- 
sion of sumPI. Let Q be as in the statement of the theorem. 
Notice that since we only consider pairs with Hamming dis- 
tance 1, for every row and column of Qi there is at most one 
nonzero entry, which is at most 1 . Thus by Proposition[2we 
have \\Qi\\2 < ||Q||i||Q||oo < 1- The theorem now follows 
from TheoremllH □ 



5.3 Hastad's method 

The shrinkage exponent of Boolean formulae is the least 
upper bound 7 such that subject to a random restriction 
where each variable is left free with probability p, Boolean 
formulae shrink from size L to expected size p'^L. De- 
termining the shrinkage exponent is important as Andreev 
IIAnd87l defined a function / whose formula size is L(/) 

Hastad llKs981 shows the shrinkage exponent of 
Boolean formulae is 2 and thereby obtains an formula 
size lower bound (up to logarithmic factors), the largest 
bound known for an explicit function. On the way to this 
result, Hastad proves an intermediate lemma which gives a 
lower bound on formula size that depends on the probabil- 
ity that restrictions of a certain form occur He proves that 
this lemma is a generalization of Khrapchenko's method; 
we prove that Hastad's lemma is in turn a special case of 
sumPI. Since Hastad's method uses random restrictions, 
which at first glance seems completely different from ad- 
versary methods, it comes as a surprise that it is in fact a 
special case of our techniques. 

Definition 29 For any function f : {0, 1}" {0, 1}, 

L A restriction is a string in {0, 1, *}" where -k means the 
variable is left free, and or 1 mean the variable is set 
to the constant or 1, respectively. 

2. The restricted function f\p is the function that remains 
after the non-k variables in p are fixed. 

3. Rp is the distribution on random restrictions to the 
variables of f obtained by setting each variable, in- 
dependently, to -k with probability p, and to or 1 each 
with probability 

4. A filter A is a set of restrictions which has the property 
that if p € A, then every p' obtained by fixing one of 
the -ks to a constant is also in A. 

5. When p is known from the context, and for any event 
E, and any filter A, we write Pr[£'|A] to mean 
FipeBjElpeA]. 

Theorem 30 (Hastad, Lemma 4.1) Lef / : {0,1}" ^ 
{0, 1}. Let A be the event that a random restriction in Rp 
reduces f to the constant 0, B be the event that a random 
restriction in Rp reduces f to the constant 1, and let C be 
the event that a random restriction p € Rp is such that f\p 
is a single literal. Then 



L(/)> 



Pr[A\A]Pr[B\A] \ 2p 



1 



P 



Proof: We show that the theorem follows from the proba- 
bility scheme (Definition^}. In this proof we only consider 
restrictions obtained from R„ that are in the filter A. We 



also abuse notation and use A and B to mean the sets of re- 
strictions in A which contribute with non-zero probability 
to the events A and B respectively. 

Implicit in Hastad's proof is the following relation be- 
tween restrictions in A and B. For every p G C, f\p reduces 
to a single literal, that is, for every p G C, there is an i such 
that f\p — Xi (or if the variable is negated). Define p^ 
to be p where Xi is set to h, for h £ {0, 1} (set Xi to 1—6 
if the variable is negated). To fit into the framework of the 
probability scheme, let p^ be p'' where all remaining *s are 
set to 1. This doesn't change the value of the function, be- 
cause it is already constant on p^. Then we say that pO, p^ 
are in the relation. 

We set pA{cr) = pTpjsj for any a e A, and pb{t) = 
Prj^yA] ™^ T £ B, and for every pair p'^.p^ in the 
relation, where p £ C, f\p — Xi or ^Xi, let 



Pr[p] 
Pr[C\A] 



The probabilities are on all other inputs. We can easily 
verify that the probabilities sum to 1. Forp', notice that the 
Hamming distance between p^ and is 1, so when p'' and 
i are fixed, there is only a single p^^'' with probability 1. 
By Theorem|9]and Theorem l24l 



L(/) > 



PAix)pB{y)PyAx)p'^^,{y) 
q{x,yY 
Pr[p^] Pr[pi] 



/Pr[C|A] 



Pr[A\/\] Pr[P|A] V Pr[p\ 



Finally, notice that Pr[p\ = Y^Pr[p^\ 



□ 



Remark Hastad actually defines f\p to be the result of re- 
ducing the formula for / (not the function) by applying a 
sequence of reduction rules, for each restricted variable. So 
there is a subtlety here about whether f\p denotes the re- 
duced formula, or the reduced function, and the probabili- 
ties might be different if we are in one setting or the other. 
However both in his proof and ours, the only thing that is 
used about the reduction is that if the formula or function 
reduces to a single literal, then fixing this literal to or to 
1 reduces the function to a constant. Therefore, both proofs 
go through for both settings. 

5.4 Razborov's method 



Razborov IIRaz90l proposes a formula size lower bound 
technique using matrix rank: 



Theorem 31 (Razborov) Let S be a covering over XxY, 
let A be an arbitrary nonzero \X\x\Y\ matrix, and TZ be a 
rectangle partition of XxY such that TZ ^ S. Then 



Tk(A) 

max ; — 

-4 maxs(zsrk{As) 



< aiS). 



It can be easily verified that the function S — + r'k{As) is 
a rectangle measure, thus this theorem follows from Theo- 
rem|23] Razborov uses Theoreml^to show superpolyno- 
mial monotone formula size lower bounds, but also shows 
that the method becomes trivial (Umited to 0{n) bounds) 
for regular formula size IRaz92l . An interesting differ- 
ence between matrix rank and and spectral norm is that 
rk(A + B) < ik{A) + rk(P) holds for any two matrices 
A, B, while a necessary condition for subadditivity of the 
spectral norm squared is that A, B be disjoint rectangles. 

6 Limitations 

6.1 Hamming distance 1 techniques 

We show that the bounds for a function / given by 
Khrapchenko's and Koutsoupias' method, and by Hastad's 
lemma are upper bounded by the product of the zero sen- 
sitivity and the one sensitivity of /. We will later use this 
bound to show a function on n bits for which the best lower 
bound given by these methods is n and for which an n^ "^^ 
bound is provable by sumPI^. 

Lemma 32 The bound given by the Khrapchenko method 
(Theorem \27l . Koutsoupias' method (Theorem \28\ . and 
Hastad's Lemma (Theorem \30[ for a function J are at most 

Proof: Let A be a nonnegative matrix, with nonzero entries 
only in positions {x, y) where f{x) — 0, /(y) = 1 and the 
Hamming distance between x, y is one. We first show that 



l^lli 



max 

A max," A 



2 

4|l2 



(8) 



Let ajnax be the largest entry in A. A can have at most 
so(/) many nonzero entries in any row, and at most si(/) 
many nonzero entries in any column, thus by item 2 of Pro- 
postion[2 

\\A\\l<\\AUA\\oo<al,^s^{f)s,{J). 



On the other hand, for some i, the entry amax appears in Ai, 
and so by item 1 of Proposition ^ 1 1 II 2 ^ '^'max- Equa- 
tionOfollows. 

Now we see that the left hand side of Equation [S] is 
larger than the three methods in the statement of the the- 
orem. That it is more general than Koutsoupias method 



is clear. To see that it is more general than the probabil- 
ity schemes method where q{x, y) is only positive if the 
Hamming distance between x,y is one: given the proba- 
bility distributions q,px,PY, define the matrix A[a;,?/] = 
q{x, y) / y/ px {x)py {y) ■ By item 1 of Proposition [0 
\\A\\2 > 1, witnessed by the unit vectors u[x] = \/px(x) 
and v[y] — \/pY{y)- As each reduced matrix Ai has at 
most one nonzero entry in each row and column, by item 2 
of Proposition[3we have 



Formula size 



Thus we have shown 



< max 



q'^jx.y) 

'i7f pxix)pY{y)' 



max ■ 
A max, 



I A; 



> max min 

px.PY.q x.y 



px{x)pY{y) 

q^ix.y) 



□ 



The only reference to the limitations of these methods 
we are aware of is Schiirfeld fSch83|, who shows that 
Khrapchenko's method cannot prove bounds greater than 

Co(/)Ci(/). 

6.2 Limitations of sumPI and maxPI 



Randomized 
formula size 




(Unweighted 

Ambainis)^ Koutsoupias 



Khrapchenko 



Hastad 



Figure 1. Summary of the methods and their limita- 
tions. The containments denoted by solid lines hold 
for total as well as partial functions. All containments 
are strict. 



The hmitations of the adver sary m ethod are well known 
IAmb02IILM04llijze03ll^ianilsSO"4 1 . Spalek and Szegedy, 
in unifying the adversary methods, also give the most ele- 
gant proof of their collective limitation. The same proof 
also shows the same limitations hold for the maxPI mea- 
sure. 

Lemma 33 Let f : {0,1}" {0,1} be any partial 
or total Boolean function. If f is total (respectively, 
partial) then maxPI(/) < \/Co{f)Ci(J) (respectively, 

Proof: Assume that / is total. Take x, y such that f{x) = 
and f{y) = 1. We choose any 0-certificate Bq for x and any 
1 -certificate Bi for y and let Px{i) = l/Co(/) for all i G 
Bo and py{i) — 1/Ci(/) for all i E Bi. As / is total, we 
have BqHBi ^ 0, thus let j e BqCiBi. For this j we have 
PxU)Py{j) > 1/ (Co(/)Ci(/)), thus min, l/px{i)py{i) > 
Coif)Ciif). 

The case where / is partial follows similarly. As we no 
longerknowthat BoHi?! 7^ 0, we put a uniform distribution 
over a 0-certificate of x and the uniform distribution over [n] 
on y or vice versa. □ 

This lemma implies that sumPI and maxPI are polyno- 
mially related for total /. 

Corollary 34 Let f be a total Boolean function. Then 
maxPI(/) < sumPI'*(/). 



Proof: By IAmb02l Thm. 5.2] we know that ^Jbs{f) < 
sumPI(/). As / is total, by the above lemma we know that 
maxPI(/) < Co(/)Ci(/). This in turn is smaller than 
6s(/)2 as C(/) < s{f)bslf) IJ^is91J . The statement fol- 
lows. □ 



Besides the certificate complexity barrier, another seri- 
ous limitation of the sumPI method occurs for partial func- 
tions where every positive input is far in Hamming distance 
from every negative input. Thus for example, if for any pair 
X, y where f{x) — 1 and f{y) = we have dnix, y) > en, 
then by putting the uniform distribution over all input bits 
it follows that sumPI(/) < 1/e. The measure maxPI does 
not face this limitation as there we still only have one term 
in the denominator. 

Following this line of thinking, we can give an exam- 
ple of a partial function / where maxPI(/) ^ sumPI(/). 
Such an example is the Collision problem (see Section lTJt . 
as here any positive and negative input must differ on at 
least n/2 positions. Another family of examples comes 
from property testing, where the promise is that the input 
either has some property, or that it is e-far from having the 
property. 



7 Concrete lower bounds 

The quantum adversary argument has been used to prove 
lower bounds for a variety of problems. Naturally, all of 
these lower bounds carry over to formula size lower bounds. 
In this section we present some new lower bounds, in or- 
der to highlight the strengths and weaknesses of maxPI and 
sumPI. 

7.1 Recursive majorities 

As an example of applying sum PI, we look at the re- 
cursive majority of three function. We let R— MAJg : 
{0,1}"^' {0,1} be the function computed by a complete 
ternary tree of depth h where every internal node is labeled 
by a majority gate and the input is given at the leaves. 

Recursive majority of three has been studied before in 
various contexts. It is a monotone function which is very 
sensitive to noise IMO03I . making it useful for hardness 
amplification in NP IO'D02I . Jayram, Kumar, and Sivaku- 
mar fjKS03l give nontrivial lower and upper bounds on the 
randomized decision tree complexity of recursive majority 
of three. They show a lower bound of (7/3)'* on the ran- 
domized decision tree complexity. As far as we know, the 
quantum query complexity of recursive majority of three 
has not yet been investigated. We show a lower bound of 2^ 
on the quantum query complexity. 

Lemma 35 sumPKR-MAjJ) = maxPI(R-MAj5) = 2'' 

Proof: To see that maxPI(R-MAJ3) < 2'', observe that 
Co(R-MAjJj) = Ci(R-MAj(j) ^ 2^. The result then 
follows from Lemmal33l 

We now turn to the lower bound. We will first show 
a lower bound for R— MAJ3, the majority of three func- 
tion, and then apply Lemma ^] Consider the following 
table, where the rows are indexed by negative instances x, 
the columns by positive instances y, and I's indicate when 
dH{x,y) = 1. 





110 


101 


oil 


001 





1 


1 


010 


1 





1 


100 


1 


1 






Interpreting this table as the adjacency matrix of a graph, it 
is clear that every vertex has degree 2. Thus Khrapchenko's 
method gives a bound of 4 for the base function. By The- 
orem 1271 we have sumPI(R— MAJg) > 2. Now applying 
Lemma[T6l gives the lemma. □ 

From Lemmal35lwe immediately obtain quantum query 
complexity and formula size lower bounds: 



Theorem 36 Let R— MAJ3 be the recursive majority of 
three function of height h. Then (5e(R— MAJg) > (1 — 
2^e(l ~e))2^ and L^(R-MAJ3) > (1 - 2e)4''. 

The best upper bound on the formula size of R— MAJ3 
is 5''. For this bound, we will use the following simple 
proposition about the formula size of iterated functions. 

Proposition 37 Let S C {0,1}" and f : S^{0,1}. If 
L(/) < s then L(/'^) < s"^, where /^^ is the d*'' iteration of 
/• 

Proposition 38 L(R-MAJ3) < 5''. 

Proof: The formula {xi A X2) V {{xi V 2:2) A X3) com- 
putes R— MAJ3 and has 5 leaves. Using Proposition 1371 
gives L(R-MAJ3) < 5''. □ 



7.2 Ambainis' function 

We define a function Ia : {0, l}'^ {0, 1} after Am- 
bainis IAmb03l . This function evaluates to 1 on the fol- 
lowing values: 0000, 0001, 001 1, 01 1 1, 1 1 1 1, 1 1 10, 1 100, 
1000. That is, f{x) = 1 when xi < X2 < X3 < X4 or 
xi > X2 > X3 > X4. To obtain this formulation from Am- 
bainis' original definition, exchange xi and x^, and take the 
negation of the resulting function. There are a few things 
to notice about this function. The sensitivity of Ia is 2 on 
every input. Also on an input x — X1X2X3X4 the value 
of Ia {x) changes if both bits sensitive to x are flipped si- 
multaneously, and if both bits insensitive for x are flipped 
simultaneously. 

We will be looking at iterations of the base function fA 
as in Definition 1 151 Notice that the sensitivity of fA** is 2'^ 
on every input x E {0, 1}^ . 

Lemma 39 sumPI(fA'') = 2.5''. 

Proof: Ambainis has already shown that sumPI(fA'') > 
2.5'^ IAmb03l. 

We now show the upper bound. We will show an upper 
bound for the base function Ia and then use the composition 
Lemma [2] Every input x 1X2X3X4 has two sensitive vari- 
ables and two insensitive variables. For any x G {0, 1}^ we 
set Pxii) — 2/5 if i is sensitive for x and Pxi^) = 1/10 if 
i is insensitive for x. The claim follows from the following 
observation: for any x,y £ {0, 1}"' such that f{x) 7^ /(y) 
at least one of the following holds 

• X and y differ on a position i which is sensitive for both 
X and y. Thus \/Px{i)Py{i) > 2/5 

• X and y differ on at least 2 positions, each of these 
positions being sensitive for at least one of x, y. Thus 

> 271725 -2/5 □ 



This lemma gives us a bound of 6.25'^ « JV^ '^^ on the 
formula size of fA**. Since the sensitivity of fA^* is 2'', 
by Lemma 1^ the best bound provable by Khrapchenko's 
method, Koutsoupias' method, and Hastad's lemma is 4^^ = 
N. 

It is natural to ask how tight this formula size bound is. 
The best upper bound we can show on the formula size of 
fA'' is lO'^. 

Proposition 40 L(fA'') < lO'^ 

Proof: It can be easily verified that the following formula 
of size 10 computes the base function fA: 

(^xi V a;3 V ^a;4)A 

((^xi A X3 A 2:4) V ((xi V ^X2) A (x2 V ^x^))) . 

This formula was found by computer search. The claim now 
follows from Propositionl37l □ 



7.3 Collision problem 

In this section we look at the collision problem. This is 
a promise problem, where for an alphabet S the inputs x = 
X1X2 . . . x„ € S" satisfy one of the following conditions: 

• All Xi are different 

• For each i there exists exactly one j ^ i such that 

Those inputs satisfying the first condition are positive inputs 
and those satisfying the second condition are negative. An 
optimal lower bound for the quantum query complexity of 
n(n^/^) has been given by Aaronson and Shi |AS04|. We 
now show that the quantum adversary method cannot give 
better than a constant bound for this problem. 

Lemma 41 sumPI(fc) < 2 

Proof: We demonstrate a set of probability distributions 
Px,Py such that for any positive instance x and negative in- 
stance y we have 

The upper bound then follows. 

Our probability distribution is simple: for every x, let 
PxiS) be the uniform distribution over \n\. Any positive and 
negative instance must disagree in at least n/2 positions, so 



□ 

On the other hand, maxPI(fc) > \/n/2. As there is an 
upper bound for the collision problem of 0{'n}^'^) by Bras- 
sard, H0yer, Tapp |BHT97 |, this also shows that in general 
maxPI(/) is not a lower bound on the quantum query com- 
plexity of /. 

Lemma 42 maxPI(fc) = 6(\Ai) 

Proof: For the upper bound: On every positive instance 
X, where all Xi are different, we put the uniform distribu- 
tion over i £ [n]; for a negative instance y we put proba- 
bility 1/2 on the first position, and probability 1/2 on the 
position j such that yi = yj. As yi = yj, any posi- 
tive instance x must differ from y on position 1 or posi- 
tion j (or both). Thus ina.Xi,xi=;Lyi Px{'i)Pyi'i) > l/2n and 
maxPI(fc) < V^n. 

Now for the lower bound. Fix a set of probability distri- 
butions px- Let X be any positive instance. There must be 
at least n/2 positions i satisfying Px{i) < 2/n. Call this 
set of positions /. Now consider a negative instance y of 
where yj — Xj for all j ^ I, and y is assigned values in / 
in an arbitrary way so as to make it a negative instance. For 
this pair x, y we have max^ \/px{i)\/Py{i) < ^2/n, thus 
maxPI(fc) > ^/nj2. □ 



The following table summarizes the bounds from this 
section. 



Function 


Input 

size 


sum 
PI 




max 
PI 


L 


SqSi 


R-MAJ3 


N 
= 3" 


2''« 

^0.63 








^1.26 




N 
= 4" 


2.5''w 


IAmb03l 


<3''« 




N 


fc 


N 


2 




e(\/]v) 


N 


_L 



8 Conclusions and open problems 

Our new formula size lower bound techniques subsume 
many previous techniques, and for some functions they are 
provably better. A significant part of our intuition comes 
from quantum query complexity and Kolmogorov complex- 
ity. Measures sumPI and maxPI have many interesting 
properties and they connect different complexities such as 
quantum query complexity, classical formula size, classical 
probabilistic formula size and circuit depth. 

An outstanding open problem is whether the square of 
the quantum query complexity lower bounds the formula 
size. Another is that we do not know a nice dual expression 
for maxPI, and it does not seem to be a natural property 
in the sense of Razborov and Rudich. Thus the study of 



maxPI may lead us to a better understanding of complexity 
measures that themselves are hard to compute. We could 
reprove a key lemma of Hastad that leads to the best current 
formula size lower bound and we are hopeful that our tech- 
niques eventually will lead to improvements of the bounds 
in LHas 98|. 
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